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paper, the path integral formula for the fundamental solution of the FPKfe is derived and 
verified for the general additive noise case (i.e., explicitly time-dependent state model and 
with state-independent rectangular diffusion vielbein). The solution is universal in the sense 
that the initial distribution may be arbitrary. The practical utility is demonstrated via some 
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1. Introduction 

In a wide variety of applications, the evolution of a state, or a signal of interest, is described 
by a stochastic dynamical model. That is, the state of the system is described by a noisy 
version of a deterministic dynamical system termed the state model. If the state process is 
continuous (discrete), then the dynamics is governed by a system of first-order differential 
equations (difference equations) in the state variable (x(t)) with an additional contribution 
due to random noise (iv(t)). The noise in the state model is referred to as the signal noise. 
If the noise is Gaussian (or more generally, multiplicative Gaussian) the state process is a 
Markov process. 

However, in many applications the signal model cannot be directly observed. Instead, 
what is measured is a nonlinear ly related stochastic process (y(i)) called the measurement 
process. The measurement process can often by modelled as yet another stochastic dynamical 
system called the measurement model. That is, the observations, or measurements, are drawn 
from a different system of noisy first order differential (difference) equations. The noise in the 
measurement dynamical system is referred to as measurement noise. The nonlinear filtering 
problem is to estimate the state of a stochastic dynamical system, given the observations 
of a related stochastic measurement process. For an excellent discussion of the subject, see 
Jazwinski[Q]. 

The conditional probability density function of the state parameters, given the observa- 
tions, is the complete solution of the filtering problem. It contains all the probabilistic infor- 
mation about the state process that is in the measurements and the initial condition. This 
is the Bayesian approach; i.e., the a priori initial data about the signal process contained in 
the initial probability distribution of the state{u{tQ, x)) is incorporated in the solution. Given 
the conditional probability density, optimality may be defined under various criteria. Usually, 
the conditional mean, which is the least mean-squares estimate, is studied due to its richness 
in results and mathematical elegance. The solution of the optimal nonlinear filtering problem 
is termed universal, if the initial distribution can be arbitrary. 

The signal and measurement processes may be discrete-time or continuous-time stochastic 
processes and there is a different type of filtering problem for each possible combination of 
continuous (discrete) signal and measurement processes. In this paper, the measurement 
process is described by a discrete-time stochastic process and the underlying signal process 
is a continuous-time stochastic process. 

The linear filtering problem, i.e., where the state and measurement processes are linear, 
was investigated by Kalman and Bucy[^, |^. For a modern review of the subject, see [§]. 
The Kalman filter has been successfully applied to a large number of problems. 

In spite of its enormous success, the Kalman filter suffers from some major limitations. 
From a Bayesian perspective, the simplicity of the Kalman filter for the linear state model 
is because it is based on the assumption that the initial distribution is Gaussian. Then, 
the probability distribution function is completely characterized by the mean and covariance 



- 2 - 



matrices^ . The Kalman filter is still optimal if the initial distribution is not Gaussian only 
under certain criteria, such as minimum variance, but not under an arbitrary loss function. In 
other words, the Kalman filter is not a universal optimal filter even when the filtering problem 
is linear. In the general nonlinear case, the filter state is infinite dimensional, namely, the 
whole conditional probability distribution function, and the Kalman filter cannot be optimal 
and will fail, although it may still work quite well if the nonlinearity is mild enough. 

It is well known that the evolution of the probability distribution of the state vari- 
able is governed by the Fokker-Planck-Kolmogorov equation (FPKfe). The solution of the 
continuous-discrete filtering problem also requires the solution of a FPKfe, since the probabil- 
ity density between measurements evolves according to the FPKfe (see, for instance, Q). The 
FPKfe is a linear, parabolic, partial differential equation(PDE)^. The literature on methods 
to solve the FPKfe type of equations is vast. In a broad sense, there are three main types of 
methods: finite difference methods spectral methods |0 and finite element methods|p. 

However, there are several difficulties in solving partial differential equations. A naive 
discretization may not be convergent; i.e., the approximation error may not vanish as the grid 
size is reduced. Alternatively, when the discretization spacing is decreased, it may tend to a 
different equation; i.e., be inconsistent. Furthermore, it may be that the numerical method is 
unstable. Finally, since the solution of a FPKfe is a probability density, it must be positive. 
This may not be guaranteed by the discretization of the FPKfe. 

A different approach was taken by Shing-Tung Yau and Stephen Yau in [^. A more 
general equation than the FPKfe, one which arises in the continuous-continuous nonlinear 
filtering problem with additive noise (and with no explicit time dependence), was studied 
in 1^. It was shown that the formal solution may be written as an ordinary, but somewhat 
complicated, multi-dimensional integral, where the integrand is an infinite series. In addition, 
they also presented an estimate of the time needed for the solution to converge to the true 
solution. Since this FPKfe is independent of the measurements, it can be solved off-line. 

In this paper, it is shown that the fundamental solution of the FPKfe, and hence the 
continuous-discrete filtering problem, can be solved in terms of a Feynman path integral for 
the general additive noise state model. Specifically, the transition probability probability 
density is expressed as a Feynman path integral. Thus, the solution of the FPKfe is reduced 
to the computation of a path integral. Although the path integral is the limit of a large- 
dimensional integral, it shall be seen that the path integrand is considerably simpler than the 
integrand in Q (e.g., no convolution or infinite series). 

Path integrals first arose in the study of quantum physics. Inspired by an observa- 



tion/remark of Dirac[|l^, 11], Feynman discovered the path integral representation of quantum 
physics |l^. Until the mid 1960s, path integrals were thought to be merely a curiosity, and 
not a serious alternative to the operator methods in quantum physics. The importance of path 



^Such a simple characterization of the probability distribution and solution is possible only for a limited 
class of filtering problems called finite dimensional filters. 

■^Sometimes in the literature the term 'nonlinear FPKfe' is used to denote the FPKfe corresponding to a 
state process with a nonlinear drift. However, this FPKfe is still a linear, parabolic PDE. 
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integrals grew when Fadeev and Popov used the path integrals to derive Feynman rules for 
the non-Abelian gauge theories. Its status was further enhanced when 't Hooft and Veltman 
used path integral methods to prove that the Yang-Mills theories were renormalizable. Since 
then, the path integral formulation of quantum field theory has led to extraordinary insights 
into quantum field theory, such as renormalization and renormalization group, anomalies, 
skyrmions, monopoles, instantons, and supersymmetry (see, for instance, the freely available 
text [0]). Much of the advance in the past 40 years in particle theory would not have been 
possible without the path integral. The path integral has also led to numerous insights into 
mathematics, such as the quantum field theory representation of the knots invariants, instan- 
tons and monopoles in supersymmetric gauge theories and the Donaldson-Seiberg-Witten 
invariants of four-manifolds, and more recently, S-duality in supersymmetric gauge theories 
and the Laglands program in mathematics. For a rigorous account of path integral methods, 
see m. 

The path integral representation is especially interesting from a computational point of 
view. A classic example is lattice quantum chromodynamics (QCD) which is the lattice 
formulation of the quantum field theory of strong interactions. Observables in quantum 
field theory require path integration over quantum fields whose dynamics is described by 
highly nonlinear infinite-dimensional systems. The path integral method has been used to 
perform the nonperturbative computation of quantities in quantum chromodynamics (see, 
for instance, fl^). The comparison to experimental results, even at coarse resolutions, with 
lattice computations is excellent. 

The following point needs to be emphasized to readers familiar with the discussion of 
standard filtering theory. In filtering theory literature, it is the Feynman-Kac formalism that 
is used. The Feynman-Kac formulation is rigorous and has led to several rigorous results in 
filtering theory. However, in spite of considerable effort it has not been proven to be useful 
in the development of practical algorithms. It also obscures the physics of the problem. In 
contrast, it is shown that the Feynman path integral leads to formulas that are eminently 
suitable for numerical implementation. It also provides a simple and clear physical picture. 
Finally, the theoretical insights provided by the Feynman path integral are highly valuable, 
as evidenced by numerous examples mentioned above. 

The path integral solution of the FPKfe has been studied in various branches of theoreti- 
cal physics for different purposes. This is because the FPKfe (and the Langevin equation) is a 
fundamental equation of statistical physics. A textbook discussion of the FPKfe is presented 
in and [|^. The results derived in this paper generalize some of the results in [|^]. Specif- 
ically, the path integral formula for the fundamental solution is derived and independently 
verified for the case when the drift in the state model is explicitly time-dependent, the diffu- 
sion vielbein is an explicitly time dependent rectangular matrix, and when the noise process 
is colored with a time dependent covariance matrix. In addition, the discussion here improves 
upon the discussion in ||l^ by clarifying important and subtle aspects of the derivation of 
the path integral formulas and by providing more details so that this powerful technique is 
accessible to a wider audience. 
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The outline of this paper is as follows. In Section ^, the Langevin equation, the cor- 
responding Fokker-Planck-Kolmogorov forward equation, and the role the FPKfe plays in 
solving the continuous-discrete filtering problem, are reviewed. In Section ^, the meaning of 
the Gaussian path integral measure is clarified and some important properties of the Langevin 
equation are reviewed. In Section ^, two derivations are presented of the path integral formula 
for the fundamental solution for the simpler case; i.e., where the drift and diffusion vielbein 
are not explicitly time dependent and the noise is white and additive. This is likely to be the 
case of most interest in practise. In Section ||, this is generalized to the case when the drift 
and diffusion vielbein are explicitly time dependent and the additive noise is colored. In Sec- 
tion ^ it is independently verified that the derived path integral formula does indeed satisfy 
the FPKfe. In the following section, some numerical examples are presented. This is followed 
by a discussion of the physical meaning of the path integral formula and a comparison to the 
solution with that obtained by S-T. Yau and S. S-T. Yau. 

In the appendices, the tools used in the derivations are reviewed including Gaussian inte- 
gration, the imposition of delta function constraints as an integral, and aspects of functional 
calculus. Although most of these topics are discussed in modern quantum field theory texts, 
it has been included in order to make the paper completely self-contained. 

2. Continuous-Discrete Filtering: A Review 

2.1 The Fokker-Planck-Kolmogorov forward Equation 

In continuous-discrete filtering theory, the state model is given by the stochastic differential 
equation of the form 

dx{t) = f{x{t),t)dt + e{x{t),t)d\i{t). (2.1) 
Here x(t) and f(x{t),t) are n-dimensional column vectors, e(x{t),t) is an n x p matrix and 



v(t) is a p— dimensional Wiener process column vector. In physics literature. Equation 2.1 is 
written as follows: 

^ = /(x(t),t)+e(x(t),tMi). (2.2) 

The stochastic process v{t) is assumed to be Gaussian with zero mean and "5— correlated" , 
i.e., 

{u{t)) = 0, (2.3) 

where Q{t) is a pxp covariance matrix. The quantity / is referred to as the (signal model) drift 
and e as the diffusion vielbein, and the quantity eQe^ is referred to as the diffusion matrix. 
The stochastic processes are represented in bold and one of its samples by its corresponding 
plain symbol. In the interest of clarity, the tensor indices are suppressed in this subsection. 
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The expectation with respect to Gaussian noise is denoted by angular brackets. Equation 
is also referred to as the Langevin equation with multiplicative noise and 5— correlated 
Langevin force. When e = e{t), i.e., the diffusion vielbein is independent of the state variable, 
the noise in the Langevin equation is termed additive noise. 

Consider an ensemble of dynamical systems with state variables evolving according to 
the Langevin equation. Due to random noise, each system leads to a different vector x{t) 
that depends on time. Although only one realization of the stochastic process is observed, 
it is meaningful to consider an ensemble average. For fixed times t = ti,i = 1,2,... ,r, 
the probability density of finding the random vector x(t) in the (n— dimensional) interval 
Xi < x(tj) < Xi + dxi{l < i < r) is given by 

Wr{tr,xr;--- ;ti,xi) = <5" (x(ti) - x^) ^ , (2.4) 

where Xj is an n— dimensional column vector. The complete information on the random 
vector x{t) is contained in the infinite hierarchy of such probability densities. The quantity 
of interest here is the conditional probability density^ 

P{tri Xr\tr—lj Xr—1, • • • ]ti, Xi) = {6 (x(tr) x{tr))) \x{tr-i)=Xr-i,...,x{ti)=xi ) x(tf.) = Xf 

(2.5) 

_ Wr{tr,Xr] ■ ■ ■ ;tl,Xi) 
/ Wr{tr,Xr; ■ ■ ti,Xi)d'^Xr 

The process described by the Langevin equation with 5— correlated Langevin force is a 
Markov process; i.e., the conditional probability density depends only on the value at the 
immediate previous time: 

P{tri Xr\tr—li Xf—\, . . . , , Xi ) — P(tni Xn | 2;^— l) • (^'6) 

This implies that 



Wr{tr,Xr; . . • ;tl,Xi) 



V-1 



P{ti+i,Xi+l\ti,Xi) 



.i=l 



Wi{h,xi). (2.7) 



Hence, the complete information for a Markov process is contained in the transition proba- 
bility densities 

^[t2,X2\tl,Xi) - — — . 

Wi[ti,Xi) 

From the identity 

W2{t3,Xs;ti,Xi) = J W3{t3,X3;t2,X2;ti,Xi){(f'x2}, (2.9) 



^Unless otherwise specified, all integration variables are from — cxd to 00. 
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and using the Markov property Equation 2.9 may be written as 



P{h,X3\h,Xi)Wiih,Xi) = j P{t3,X3\t2,X2)P{t2,X2\ti,Xi)Wi{ti,Xi){(Px2]. (2.10) 

Since Wi{ti,xi) is arbitrary, the Chapman-Kolmogorov equation results: 

P(t3,X3|tl,Xi) = j P{t3,Xs\t2,X2)Pit2,X2\tl,Xi){rX2}. (2.11) 

This is also referred to as the Chapman-Kolmogorov semi-group property of the transition 
probability density. This property plays a fundamental role in arriving at the path integral 
formula. 

It can be shown that the state probability distribution functionp(t, x) = J P{t, x\t'x')p(t', x'){(i"'x'} 
satisfies the Fokker-Planck-Kolmogorov forward equation (FPKfe) (see for instance, |T7|]): 



j^{t,x) = -Y, — [fi{x{t),t)p{t,x)] + - Y.Y.Q^. Hx{t),t)Q{t)\i, — {elk{x{t),t)p{t,x)) 

i=l * i,k=l b=l * 

(2.12) 

= ^p{t,x). 

The diffusion matrix is clearly a symmetric matrix and is also positive semi-definite: 
Dijaittj > 0. Often, Dij is assumed to be positive-definite, in which case the inverse of the 
diffusion matrix exists. An important property of the FPKfe is that it is a continuity equation. 

2.2 Continuous-Discrete Filtering: Model and Solution 

In this paper, it is assumed that the dynamics is given by a continuous-time process and 
observations are samples of a discrete-time process. The continuous-time state model is 



described by Equation |2.1| . The measurement model is described by the following discrete- 
time stochastic process 

y(tfc) =/i(x(tfc),tfc)+w(tfc), k = l,2,..., tfc+i>ifc>0, (2.13) 

where y{t) £ W^^^, h E R'"^'^ and the noise process is described by w(t) ~ N{0, R{t)). As 
discussed below, the Gaussian assumption in the measurement model can be relaxed. 

Let the initial distribution be o"o(x) and let the measurements be collected at time instants 
ti,t2, ■ ■ ■ ,tk, ■ ■ ■■ We use the notation Y{t) = {y{ti) ■ to < ti < t} . Prior to incorporating the 
measurements, the state evolves according to the FPKfe; i.e., 

^(t, x\Y{to)) = ^{p{t, x\Y{to))), to<t< h, (2.14) 
p{to,x\Y{to)) = cro(x). 

This is the 'prediction' step. 
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From Bayes' rule (and using p(ti, x\Y{ti)) = p{ti,x\y{ti),Y{ti^i)) at observation ti, the 
'corrected' conditional density at time ti is 

r>(h x\Y(h )) - Piyiti)\^)Pih,^\Y{to)) 

This is then the initial condition of the FPKfe for the next prediction step which results in 
^(t, x\Y{ti)) = if (p(t, x\Y{ti))), ti<t< t2, (Prediction Step), (2.16) 

and so on. Thus, at observation at time t^, the conditional density is given by 
/, ivf. ^^ piy{tk)\x)p{tk,x\Y{tk^i)) 

where p{y{tk)\x) is given by 

P{yitk)\x) = - — ^ — exp I -^ivitk) - h{x{tk),tk)Y'{R{tk)y^{y{tk) - h{x{tk),tk)) 

((27r)'»deti?(tfc)) ^ I 2 

(2.18) 

and p{tk,x\Y{tk-i)) is given by the solution of the PDE 

—p{t, x\Y{tk-i)) = ^{p{t, x\Y{tk-i))), tk-i <t<tk, (2.19) 

with initial condition p{tk-i, x\Y (tk-i))- Note that the measurement model noise need not 
be Gaussian; the Gaussian assumption merely specifies the form of p{y{tk)\x) to be given by 
Equation |2.1S . 

Thus, the non-trivial part of continuous-discrete filtering is the solution of a FPKfe. 
Note that the FPKfe is independent of measurements, and hence can be computed off-line. 
Furthermore, observe that the complete information is in the transition probability density 
which also satisfies the FPKfe except with a <5— function initial condition (known as the 
fundamental solution, or kernel, of the FPKfe): 

—pit,x\T,y)=^{p{t,x\T,y)), t>T, (2.20) 
p(t,x\t,y) = 6{x - y). 

This is because from the transition probability density the probability for an arbitrary initial 
condition can be computed as follows: 

)p{tn-i-,x'){(F-x'}. (2-21) 
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Thus, it is sufficient to solve the FPKfe for x|t, y) and use the above to compute x\Y{tk~i)) 
using 



p{tk,x\Y{tk^i)) = J p{tk,x\tk-ux')p{tk-i,x'\Y{tk-i)){d''x'}. (2.22) 
Alternatively, a convenient set of basis functions may be used. Then, the evolution of 



each of the basis functions under the FPKfe follows from Equation 2.21. Since the basis 



functions are independent of measurements, the computation may be performed ofF-line|19]. 
Note that the solution is universal; ie., the initial distribution can be arbitrary. 



In conclusion, the solution of Equation |2.20| is equivalent to the solution of the universal 
optimal nonlinear filtering problem. A solution in terms of ordinary integrals was presented 
in Q. Here, the solution in terms of Feynman path integrals is presented. 

3. Some Preliminary Remarks 

The reader may refer to the appendices for the derivation of the results used here. 
3.1 Gaussian Path Integral Measure 



The Gaussian noise process f (t), defined in Equation 2^, may be represented by a path 
integral measure 



[dp(i/(0)] = [^z^(t)] exp 



a, 6=1 



9X1 



(3.1) 



where v^t) is a real vector for each t. The meaning of Equation 3.1 is explained next. 

Recall that the probability measure of a n— dimensional Gaussian vector v with zero 
mean and covariance matrix Q is 

The first moment or mean is 



(3.2) 



VdetQ \2tt J 



p/2 



(3.3) 



0, 



by virtue of the odd symmetry of the integrand. The second moment is easily seen to be 
(e.g., from discussion in Appendix ^ 



/ {dJ'u} VcUd exp ( -i Ea,fe=l VaQah^b 



I {d''''} exp (-^ Ea,fe=l ''aQab'^b 
Qab- 



(3.4) 



- 9 - 



It is straightforward to generalize the above for the case of a sequence of random vectors 
which are uncorrelated with each other at different times: 

X exp --^ ^ Va{tk)Qj{tk)Vh{tk) I = Qab{tk)6kl- 
y k=la,b=l J 

Note that the noise may be colored. It is also important to note that had uncorrelatedness at 
different times not been assumed, the exponent would not be a sum of quadratic terms local 
in time. 

For the continuum limit, the following expression is of interest (in the notation of Ap- 
pendix |A[ a{t) = = At) 

/ {n 7mm i^S" *'''^**^"} '''' 

( At ^ " _ \ 1 

X exp ^ XaiQ ^{tk))abXb = -^Qab{tk)6kl- 

\ k=l a,b=l J 

The measure in the continuum limit (i.e.. At 0) is formally written as 

[dp{um ^ [^z.(t)]exp I -i I Mt){Q-\bn{t)dt\ , (3.7) 



where 

^ r / 1 \ «/2 -I ^ 
i V2vrAty ^det Q(tfc) J 



Hence, in the continuum limit, the mean is easily seen to be and Equation 3.6 becomes 

{Ua{t)Mt')) =Qab{mt-t'). (3.9) 

These results may also be directly derived using the methods of functional calculus(see 
Appendix In particular, it is straightforward to verify that the mean is zero due to odd 
symmetry (setting a = 1/2) 

{u,{t)) = I [dp{u)]u,{t), (3.10) 
= 0, 
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and the variance is Q{t): 

{v,{t)ud{t')) = I [dp{i^)]u,{t)ud{t'), (3.11) 



'diy{t)]iJ,{t)ud{t')ew i-\JdtY^ Va{t){Q \t))abiyb{t) 



a,fe=l 



6bc{t) 5hd{t) 
Qcdit)5{t-t'), 



exp I -i I dt' Ut')Qab{t')h{t') 



a,6=l 



b=0 



and higher-order moments can easily be written down using Wick's theorem. The results are 
in accord with the expectation that the measure represents a Gaussian process with these 
first two moments. 

3.2 Basic Properties of the Langevin Equation 

We conclude this section with some fundamental properties of the Langevin Equation. 

• Time Translation invariance: When the drift does not depend on time explicitly, i.e., 
/(x(t)), then the process is time-translation invariant; i.e., does not depend on the 
origin of time. Alternatively, the time translation operator ^ commutes with the FPK 
operator 



$ = - = 0, (3.12) 

dt dt ' ^ ^ 



where $ is an arbitrary differentiable function. 

• Locality: Langevin equation is a differential equation specifying the time evolution as 
a system of first order stochastic differential equations with all terms evaluated at the 
same time. In other words, the Langevin equation is local in time. 

• Noise uncorrelated at different times: This is an important assumption which leads to 
the (5— correlated in time property of the noise process. 

From the locality of the Langevin equation and the uncorrelatedness of noise at different 
times, it immediately follows that P satisfies the Chapman-Kolmogorov semi-group property 
(Equation |2.9| ), an important property of Markov processes: 

P(t3,X3|tl,Xi) = j Pit3,Xs\t2,X2)P{t2,X2\tl,Xi){d''x2}. (3.13) 

This implies that the distribution P is completely determined from its knowledge for small 
time intervals. It is important to note that if the Langevin equation was not local and/or the 
noise was not uncorrelated at different times, the Chapman-Kolmogorov semi-group property 
would not be valid^. 

^However, if noise is correlated at different times, some non-Markovian processes (for instance, exponentially 
correlated) may be reduced to Markovian processes. 
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4. Path Integral Formula I: Implicit Time Dependence 

In this section, the noise is assumed to be additive, white, and Gaussian-distributed. The 
state model is given by 

dx{t) = f{x{t))dt + dy,{t), x(t),/(x(t)),v(t) gM", Q^J = hJ^j. (4.1) 

Observe that the drift is not explicitly time-dependent, and the diffusion vielbein is pro- 
portional to the identity matrix with proportionality constant h^. In this instance, the Ito 
and Stratanovich forms are the same. A sample of this stochastic process is the differential 
equation 

Xi{t) = fi{x{t)) + Ui{t), i = l,2,...,n, (4.2) 
where is the sample of a Gaussian noise process with (^—correlated covariance matrix h^I. 
4.1 Formal Derivation Using Functional Calculus 

The transition probability density, {6[x{t) — x]) ^^\x(to)=xoJ is now derived using functional 



methods for state model given by the Langevin equation given by Equation 4.1. From the 



path integral representation of the Gaussian noise in Equation |3.1| , it follows that 

[x(t) - x])^ U.(t„)=,„ = j [dpii^)] r [xit) - x] Ut,)=,„ x{t) = f{x{t)) + i^it). (4.3) 
The condition that x{t) in the integrand satisfies the Langevin equation needs to be imposed. 



This can be done using the identity Equation |C.1[)| as follows: 

(n 

X I [^x(0] J6^±{t)-f{x{t))-u{t)), 
X (^"kli) -a;]U(to)=xo, 

where 

5 



'ufm] (4.4) 
'to 



n , 

J = det ( 



bXj (t 



^{xi{t)-U{x{t))-vm\- (4.5) 



Interchanging the order of integration and then performing the trivial integration over v{t) 
leads to 

P(t,x|to,a;o) = y [^^Wlexp^-^f^y {±^{1) - h{x{t)f dt\^ J S^\x{t) - x\\^(t,-^=^,. 

(4.6) 
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It remains to compute the Jacobian and convert the path integral to one over x{t). The 
Jacobian J follows from noting that 



5xj{t') 



[x^{t) - Mxit)) - ,.,{t)] 



S{t-t'), 



(4.7) 



dt 



6,^6{t-t')-e{t-t')^{x{t')) 



because 



—0(t-t') = 6{t-t'), and 



Sxi{t) 
6xj{t') 



The Jacobian of the transformation is then given by 



J = det 



5,^6{t-t')-e{t-t')-^{x{t')) 



(4.8) 



(4.9) 



det (^1) det (^6,,6{t - t') - e{t - t')^{x{t'))^ 



= yKdet (6ij6{t - t') - 9{t - t')^{x{t')) 

CJJb j 

where ^ is an irrelevant constant. From the identity det A = exp(trlnA), and using the 

lnJ = lndet 5i^5{t - t') - OH - t')^{x{t') , (4.10) 



identity in Equation C.12, it follows that 



5..5(t-t')-e{t-t')^{x{t') 



to 



i=l 



i=l " 
" ft 



to 



dxi 



'x{t))dt. 



Since the symmetric or Feynman convention is used here^, 0(0) = ^. It also ensures that the 
operations of averaging and differentiation commute (see, for instance, |^]). Also, 

[&x{t)]^ |[^^(i)]det(^) 

where the irrelevant constant (lei{d/ dt)6{t — t') has been absorbed into the measure. 
Combining these results leads to the following path integral formula: 



det ( j^5{t -t')] I [!^x{t)] 



Mt)], (4.11) 



P{t,x\to,xo) 



x{i)=x 



x(to)=xo 



[Qx{t)\ exp 



1 

'2K, 



" rt 



i=l 



to 



{±,{t) - h{x{t))f + K^{x{t)) 

uXi 



(4.12) 



^The Stratanovich convention symmetrizes the argument of the diffusion vielbein, while the Feynman 
convention symmetrizes the argument of the drift. 
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4.2 Derivation using the Chapman-Kolmogorov Semi-group property 

Consider the time evolution of the state in an infinitesimal time interval. Integrating the 
Langevin equation (Equation 4.1) in an infinitesimal time interval {t,t + e) yields 

Xi{t + e) = Xi{t) + e/,(x(t)) + J'^' u,{T)dT + 0{e^l^). (4.13) 

To order e, the continuum Langevin equation is equivalent to the discretized Langevin equa- 
tion: 

Xi(t + e) = Xi{t) + e/i(x(t)) + ^€Ui{t), (4.14) 

where 

[■t+e 

VeMt) = ^■.{T)dT. (4.15) 
From the properties of Vi{t), it follows that 

{^eui{t))= j'\v,{T))dT, (4.16) 



0, 



and 



e {V,{t)v,{t')) = n^'^' ^\,{T)v,{T')dTdT'\ , (4.17) 



t+e ft'+e 

{vi{T)vj{T')) drdr', 

t+e ft'+e 

hu6ij6{T — T')dTdT' , 

'o t^t', 
ehydij t = t' 



Therefore, 



{Ui{t)) = 0, (4.18) 
{ui{t)i>j{t')) = hJijSw, 

which implies that P{t) is also a (discrete-time) Gaussian process. 

Consider the quantity P{t + e, x\t, x'). The Fourier transform P of P w.r.t. x is 

P{t + e,p\t,x') = J {d''x}exp l^-iY^piXi^ P{t + e,x;t,x'), (4.19) 
= J exp (^-iY,Pix}j (5"(x(t + e) - x)) |^( 



K(t)=X ■ 
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The order of integration over x and averaging are interchanged to yield 

P{t + e,p\t,x') = ^ exp { -i'^piXi{t + e) | j , 

i=i J I 

n 



(4.20) 



exp 



i=l 



exp 



i=l 



Now, 



^exp (^-^V-etp^r>^{t?j ^ = J ^^^^ i^''^^'^} ("'^S''' 



(4.21) 



exp 



Inverting the Fourier transform, it follows that 



P{t + e,x\t,x')^ -^^-^ J {cTpjexp ( i^pi(xi-a;--e/i(x'))-ey ^ 

1 

■-— ^(xi-x^-e/i(x')) 



p1 , 



(4.22) 



\/(2^^M" 



1=1 



Partition the time interval [to,t] into N cqui-spaced time intervals so that ti = to + ie where 
e = {t — tQ)/N. Then, from the Chapman-Kolmogorov semi-group property it follows that 

P{t,x\to,xo) = J {(fxih) ■ ■ ■ (f'xitN-i)}Pit,x\tN-i,x{tN-i)) ■ ■ ■ P{ti,x{ti)\to,xo), (4.23) 



/ 1 n d^ti) 



N 



llP{ti,x{ti)\ti^l,x(ti-l)) 



.i=l 



where x{to) = xq and x{t]y) = x. Therefore, 

fx(t)=x 



P{t,x\toxo)= / [^x(t)]exp -—Siito, 

Jx{to)=xo L "■i^ 



t) 



where 



N n 



Siih, *) = 1™ ^ XI XI - Xi{tk-i) - ef{xi{tk-i))y 



(4.24) 



(4.25) 



k=l i=l 



i^xit)] = lim , ^ n 



d^'xiU) 



^-0 y(27r^^e)" fJi [ V(27r^^e)" j ' 
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Observe that the extra factor of 1/ \J {27rfiue)^ is due to N terms in the square bracketed term 
in Equation 4.23. In the continuum notation, this is 



P{t,x\to,xo) 



x(t)=x 



x{to)=XQ 



[Six{t)] exp 



t n 



I dr^(xi-/,(x(0)(t)))2 

*0 1=1 



(4.26) 



where the superscript (0) is to remind us that the drift is evaluated at the earher time. This 
is termed the "pre-point" version. 

This formula clearly differs from the formula (Equation 4.12| ) obtained using functional 
methods. In fact, another important difference is that in the Equation 4.26, the argument of 
/ is to be evaluated at the pre-point, rather than the mid-point (as in the case of Equation 



4.12| . This discrepancy is resolved once the pre-point form of the PI formula is converted to 



the symmetric form. 

In order to obtain the result in the Feynman convention, the argument of / needs to be 
symmetrized. It is straightforward to obtain a more general expression as follows. As evident 
from Equations 4.13 and 4.18 , typical values of x — x' ~ 0(ei/2). Let = x' + r{x - x'). 
Then, it follows that the leading order term in Xi — x'- — efi{x') is given by 



Xi-x',- efiix') = X, - - 6/,(xM) + re V(x,, - x',,) {x'^'^ ) + 0{e^){x) 

i^i 

The last term may be eliminated by introducing a new variable Xj defined as 

Xj = Xi re ^(xj' - x-,)^^(x'^''^). 
i<=i 

The Jacobian under this transformation is (d"x = det(3xj)/f?Xj)d"x) 



det 



exp 



2 ^Ox,^ ^ 

1=1 



(4.27) 



(4.28) 



(4.29) 



where the identity detvl = exp(trlnyl) has been used. Note that the transformation in 
Equation 4.28| does not change the x — x' term in Se{to,t) since the difference is 0(e^/^). 
Thus the overall exponent is 

N n 

S,{to,t) = ^mY,Yl 



k=l i=l 

Therefore, the result for the transition probability density is 



(4.30) 



P{t,x\to,xo) 



x{t)=x 



x(to)=xo 



1 



i^x(t)]exp ( -—Sito,t) 



where 



i=l •'^0 



dr. 



(4.31) 



(4.32) 
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When r = |, or the mid-point or Feynman discretization, agreement with the result obtained 
by functional methods is seen. The r = case is, of course, the pre-point version. The 
exponent, S'(t,to), is referred to as the action in physics literature. 

5. Path Integral Formula II: Explicit Time Dependence 

In this section, the general additive noise model is considered: noise is colored and the 
diffusion vielbein is a time-dependent rectangular matrix. Since the diffusion vielbein is no 
longer invertible, the method used in the previous section is not applicable. 
The state model is given by 

dx{t) = f{x{t),t)dt + e{t)dv(t), (5.1) 

where x{t),f{x{t)) £ W^,v>{t) G R^, and e{t) £ W^^p. Observe that drift is now an explicit 
function of time and the diffusion vielbein, apart from being a rectangular matrix, has an 
explicit time dependence as well. However, the diffusion term is assumed to be independent 
of the state vector; the more general state-dependent, or multiplicative noise, case is more 
subtle and will be studied in a future paper. Again, the Ito and Stratanovich forms are the 
same for this equation. A sample of this stochastic process satisfies the following differential 
equation 

p 

ii{t) = fi{x{t),t)+Y,eia{t)va{t), i = l,2,...,n, (5.2) 

a=l 

where the v is a sample of a Gaussian noise process with 5— correlated covariance matrix Q. 

It is assumed that the diffusion matrix is invertible; if not, it is physically plausible that 
results for the singular case are close to the "nearly singular" case, which is obtained by 
adding a small quantitity to render it invertible. 

5.1 Formal Derivation Using Functional Calculus 

The discussion here is similar to that in Section However, imposing a delta function 
constraint requires a different method (as in Section P) since e{t) is not a square matrix; and 
hence is not invertible. The path integral expression for the transition probability density is 



{d^[x{t) - x]) U(,„)=,„ = J [&p{uW W) - x\ Ut,)=.,, (5.3) 

= j [^z.(t)]exp j-i j ^a{t){Q-\t))abUb{t)dt \ x5[a;(t)-x]U(t„)=,„, 



a,b=l 

where x{t) satisfies the Langevin equation 



i{t) = fiix{t),t) + eia{t)Ua{t). (5.4) 



a=l 
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The Langevin equation condition can be imposed Equation C.IC to yield 



P{t,x\to,xo) = J [^i/(t)]exp 1^-^ J ^a{t){Q'\t))abn{t)dtj (5.5) 

r ^ 



X 6[x{t) -x]U(to)=a;o- 



Upon evaluating the functional derivative of the Langevin equation 



6xj{t') 



iit) - n{x{t),t) -Y,e^ait)Ua{t) 



a=l 



6{t-t'), (5.6) 



d_ 
It 



5,^5{t-t')-e{t-t')^{x{t')X) 



(5.7) 



it follows that the Jacobian J is given by 

J = det {^j^ det {5ij5{t - t') - e{t - t')^{x{t'),t')^ , 
= J^det (^6ij6{t - t') - e{t - t')^{x{t'),t'))^ , 

where ^ is an irrelevant constant that can be ignored (or absorbed into the measure). Hence, 



In J = In det 

n 



5,^5{t-t')-e{t-t')^{x{t'),t') 



(5.8) 



2 / dx. 



ix{t),t)dt. 



Also, following arguments in Equation 4.11 the measure [^i;(f)] can be replaced by [^j;(t)]. 
Thus, so far, 



x{t)=x 
x{to)=Xo 



P{t,x\to,Xo) = I [^x(t)] [^l^(t)]exp I / ^ ^ '^amQ-\t))abl^bit)dt | (5. 

i,j=l a,b=l 



n p 



X 



<5" (ii{t) - fMt),t) -Y^eUtHm exp (-i /■^M(x(t),t)dt 



2J ^dxi 

a=l / \ 1=1 



Using the Fourier integral version of the delta function, the transition probability density 



becomes 

P{t,x\to,xo) 



x{t)=x 



x{to)=xo 



[&x{t)] [&v{t)] [&\{t)] 



(5.10) 



X exp 



exp ii I X^Ai(t)(xi(t)-/,(x(t),t)-^eia(t)z^a(t))) 

V i=l a=l ) 

\\Y. ^a{tm-\t)Uv^{t)di \ xexp M / 

a,fe=l / \ i=l 



dxi 



{x{t),t)dt . 



Integration over ^{t) results in 

rx{t)=x 



P{t,x\to,xo) 



x(to)=xo 

X exp 



^x{t)] [&X{t)] exp 



dt 



2 ^ dx 

1=1 



(5.11) 



dt 



X^it)gij{t)X,it) -iYHt) {±i{t) - fi{x{t),t)) 



i-1 



Finally, integrating over Xi{t) leads to 

P{t,x\to,xo) 

where the "action" S is 



x{t)=x 



x{to)=Xo 



[&xit)]expi-S), 



S=^ I dt 



i=l 



Y iMt) - h{x{t),t))g-^{t) {x,{t) - f^{x{t),t))+Y^{x{t),t) 

i,j=l ■ ' 



(5.12) 



(5.13) 



5.2 Derivation Using the Chapman-Kolmogorov Semigroup Property 



Integrating the Equation 5.1 over an infinitesimal time interval {t,t + e) yields 

P rt+e 

Xi{t + e)=Xi{t) + eMx{t),t) + Y eia{t)ua{t) + 0{e^/^). (5.14) 

a=l 

Note, in particular, that replacing t with t + ae, in e{t) and /(x(t), t) (where a E (0, 1)) leads 
to errors of order higher than e (0(e^/^)) and hence are irrelevant in the continuum limit. The 
continuum Langevin equation (Equation |5.1| ) then becomes the following discretized Langevin 
equation: 



Xi{t + e) = Xi(i) + efi{x{t),t) + \/e ^ eial^a(^)• 



a=l 

As in the previous discussion, the following identification has been made 

P P 



f f rt+t 

\/e^eia(t)i>a(t) ^Y eia{T)UaiT)dT 
a=l a=l 



(5.15) 



(5.16) 
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Since 

{Ua{t)) = 0, (5.17) 
{Va{t)Ub{t')) = Qab{tW - t'), 

it is clear that 

p p 
e {e^ait)i>ait)i>bit')el{t')) = e {i>ait)Pbit')) ej^it), (5.18) 

a, 6=1 a, 6=1 

a,b=l 
t+e ft'+t P 



t Jt 



Hence, it follows that 



/t+e rf+e f 

* a, 6=1 

f//"^' Ea,6=l eiaiT)QabiT)eljdT, t = t' , 

\o, t^t' ' 

UElb=l^^air)Qabir)el^ir), t = t', 
[0, t^t' ' 



{i>a{t)) = 0, (5.19) 

{v'a{t)Ub{t')) =Qab{t)6tt', 

and i>{t) is a discrete-time Gaussian process. 

The Fourier transform of P{t + e, x; t, x') with respect to x is 

P{t + e,p\t,x') = J d'^xexp(^-iY^piXi^ P{t + e,x;t,x'). (5.20) 

Interchanging the order of averaging and integration over x leads to 

P{t + e,p\t,x') = Md^xexp l-i^PixA <5"(x(i + e) - x) \ , (5.21) 



i=l 



n 



^exp I -i^piXi(t + e) 



i=l 



Substituting the value of x(t + e) in Equation 5.15, 



P{t + e,p\t,x') = ^exp \^-iY,piXi{t + e)jj, (5.22) 
= exp -z^Pi(x- + e/i(x',t)) ( exp -z^e ^ ^Pieia(t)i>a(t) V 



i=l / \ \ j=l a=l 



20 



Now, 



n p 



exp -iVe^^pieia{t)va{t) 



(5.23) 



i=l a=l 
1 



n p 



^(27re)"detQ(t) 



n p 



i=l a=l 



a,b=l 



= exp -- ^ ^ Pi^i'='(*)Q'^b{t)e]^j{t)pj I . 

\ i j=l a,fe=l / 

Combining Equations 5.23 and ^.22 , it is evident that 

(n p n 

~2 ^ ^ Pieia(t)Qa6(t)e6jpj - + e/i(x',t)) j . (5.24) 

i,j=l a, 6=1 i=l 

Inverting the Fourier transform, 



P{t + e,x|t,x') 



^ y {f^>} exp^z^pi^i^ P{t + e,p;t,x'), 
If / e " ^ 

— ^ / {d>}exp "2 X] Pi(^ia{t)Qab{t)elj{t)pj 

\ i,j=la,b=l 



(5.25) 



exp I +i^pi(3;i -x[- efi{x',t))j , 

:exp(-5e(t + e,t)) 



1 



^(27re)"dete(t)Q(t)e^(t) 



where 



1 " 

S,it + e,t) = +- J2 



{xi - X - - efi{x',t)) 



a, 6=1 



(xj - x'j - efj{x',t)) 
(5.26) 



From this result, the finite time result can be calculated using the Chapman-Kolmogorov 
semi-group property of the transition probability density. Specifically, divide the time interval 
{t,to) into parts, i.e., e = (t — tQ)/N, and use Equation 4.23 to get 



J 



1 / d"x(to + ie) 



P(t, xlto, xq) = lim , ^ , , > ^ 

^-o 7 ^(27re)"det5(to) t\ [ ^(2^^)" det5(to + ie) 



exp -Se{to,t), 

(5.27) 
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where 



N n 



Si{t, *o) = lim ^ XI m [ (^i(^k) - Xi{tk-i) - efi{x{tk-i),tk-i)) 



(5.28) 



k=l i,j=l 



X 9ij (tk) (xjitk) - Xj(tk-i) - efj{x{tk-i),tk-i)) 



and 



I a,6=l 



(5.29) 



When taking the continuum hniit, the argument of / needs to be symmetrized in the Feyn- 
man form. Note that it suffices to symmetrize x in the argument since x — x' ~ 0{^/e). 
Symmetrization with respect to t, i.e., tk-i — > ^(tfc + ife-i), leads to an error higher than 0(e) 
and vanishes in the continuum hmit. 

Thus, setting x{tk) = ^{x{tk) + x{tk-i)), 

Xi{tk) - Xi{tk-i) - efi{x{tk-i),tk) = Xi{tk) - Xi{tk-i) - efi ( x{tk) - ^^^^~^\ tk 



(5.30) 

= Xi{tk) - Xi{tk-l) + - ^{Xj{tk) - Xj{tk-l))-K^ix,t). 



Define x as follows: 



Xi — X' 



6 ^ d f ' 



i=i 



(5.31) 



The Jacobian under this change of variables (from x to x) is 

-1 



det 



det 



exp 



h + 



2dxi 



"2^ 



(5.32) 



where the identity det(l + eA) expetr^ has been used. Hence, the overall action is 

, AT n 



k=li,j=l 



{xi{tk) - Xi{tk-i) - efi{xi^k,tk))9i/{tk) {xj{tk) - Xj{tk-i) + efj{xk,tk)) + 2^(^(^fc)'*fc) 



(5.33) 
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Therefore, in the continuum hmit, the transition probabihty density is given by 

fx(t)=x 



P{t,x\to,XQ) = / [&x{t)]exp{-SF{to,t)), 

Jx{to)=Xo 



where the action is 

1 " /■* 
SF{t,to) = -Y^ / 
^ i=i -^'o 



Mt) - Hx{t),t)]g;^\t)[x,{t) - fj{x{t),t)] + ^{x{t),t) 



dxi 



(5.34) 



(5.35) 



and 



N-l 



[&x{t)] = lim ■ 



J] U^x{to + ie)- 



-0 V(27re)"det5(to) t=\ I ^/ i27Te)^ det g{to + ie) 

It is clear that for the general discretization the action is 



(5.36) 



(5.37) 



Naively, it seems that the path integral formula depends on r, which contradicts the fact 
that the FPKfe and the Langevin equation do not depend on r. This is resolved in Section^ 
where it is demonstrated that the r dependence of the Feynman path integral formula is 
illusory, i.e., r dependence cancels. 



6. Verification of the Path Integral Formula 

In this section, it is independently verified that the path integral formulas derived in the 
previous sections are the fundamental solutions of the FPKfe. Clearly, it is sufficient to 
prove the path integral formula derived for the explicit time-dependent case and for arbitrary 
discretization. The method used closely follows the techniques first used by Feynman to verify 
the path integral formula he derived for the fundamental solution of the Schrodinger equation 
(see 0). 

From the Chapman-Kolmogorov semi-group property, it follows that 



According to Equation 5.25 and 5.37] 

P{t + e,x\t,x') ^ 



V(27re)"det5(0 



X exp 



E 



(6.1) 



(6.2) 



2 ^ ' 

1=1 
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Here x^^^ = x' + r(x — x'). 

The quantity P{t + e, x\t, x') is large only if 



x — X 



We may then write 



X = x' + ef{x^''\t) +r] 



or 



x' = X — ef{x^^^ ,t) — r] 



where the equalities are valid to 0(e). Substituting this into Equation p.2| , 
P{t + e,x\to,xo) = J exp ^-^ ^ j 



(6.3) 
(6.4) 
(6.5) 

(6.6) 



det g 



Next, the variable of integration is changed from x' to rj. Since 

rj = x' — X + ef{x' + r{x — x').t), 

so that 

+ r)eg^,{x ,t), 
and the Jacobian of the transformation from x' to rj is 



i=l 



(6.7) 
(6.8) 
(6.9) 

(6.10) 



Combining these results leads to 

/oo / 1 " 

exp I - — ^ Vi9^^{t)r]j 



(6.11) 



IE i) ^c-'i'o...) (i - (1 - .).|: If) ^ 



det (7 



i=l 



Y^(27re)" det g ' 
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where the Jacobian of the the transformation from x' to r] to 0(e) is included. 



The left hand side of Equation 6.11 is 



dP 

Pit, x|to, xq) + 2;|to, x^)- 



(6.12) 



Also, 



dP 1 d'^P 

P{t,x - ef{x,t) - ri\tQ,XQ) = P{t,x\tQ,xo) - e{fi{x,t) + r]i) — {t,x\to,xo) + -T?i7?-,- ^^^^ (t, x|to, xq), 

(6.13) 

where terms that contribute to the order of e have been retained. Therefore, the integrand 

/ " i9/j \ f dP 1 d^P 

[ 1 - 1 ( ^(*;^l*o,a;o) - e{fi{x,t) + r]i) — {t,x\to,xo) + -?7»?7i g^.g^ . (^; a:|to, a^o 



becomes, to 0(e), 



"9 1 (?^P 

P(t,x|to,a;o) - ^X]^ [/i(2;,*)^'(*,a;|to,xo)] + ^^^^ (t, x|to, xq). 



(6.14) 



(6.15) 



Here, the term linear in r/ has been dropped since its contribution to P{t + e, x\to, xq) vanishes 
upon integration over rj due to odd symmetry. 
Since 



\ «',i'=i / 



1 



V(2^e)"det5(t) 
1 

V(2^e)"det5(t) 



the right hand side of Equation p. 11 



IS 



(6.16) 



i=l 



P{t,x\tQ,xo) -<^^-Q^ [fi{x,t)P{t,x\tQ,XQ)\ + ^ 9ij{t) {t^A^Q^xo). (6.17) 



dxidxj 



Finally, note that 



P{t, x\t, x') = lim 



y^(27re)'" det 5(t) 
5"(x-xO. 



(1 " 



Xj - ) , (6.18) 



25 



Therefore, from Equations 5.12| , 5.17 , and 6.1g it follows that P{t,x\to,xo) is the funda- 
mental solution of the FPKfe: 



dP . 
'dt 



n Q ^ n 

(t, x|to, ^o) = - ^ ^ [fi{x, t)P{t, x\to,Xo)] + ^ X] 



i=l 



92 



2 ^ Sx-ax- i9ij{t)P{t,x\to,xo))] 
■,i=i * 



P(t,x|t, xo) = - xo). 



(6.19) 



7. Examples 



In this section, a few examples are presented illustrating the utility of the path integral 
formulas derived in this paper. For simplicity, all examples are one-dimensional. Note that 
an accurate determination of the transition probability density automatically implies that 
the continuous-filtering problem can be solved very accurately. It is therefore sufficient to 
demonstrate that the transition probability density can be computed accurately. 

From the discussions in the previous sections, the one-step r = 0, or 'pre-point', approx- 
imate formula is 

1 



P{t",x"\t',x'' 



X exp 



y/{27r{t" -t))detg{t') 
{t"-t') 



(7.1) 



E 



x" — x' 



{x" - x') 

it"-t') 



The one-step r = ^, or symmetric, approximate path integral formula for the transition 
probability amplitude is 



P{t",x"\t',x" 



^{27T{t" -t))detg{t) 



(7.2) 



{t" - t') ^ 
X exp I ^ 2^ 



x" — x' 



t" - 1 



T-h{x,t) 



2(2;" + x') and t = ^{t' + t'' 



9^ it) 



{x" — x') 
{t"-t') 



fj{x,t) 



i=l 



where x — ' -'^ ---^ +_!/'+' 1 +"^ 



Example 1 (Wiener Process) The simplest example is the following state model: 

dx{t) = adv{t). (7.3) 

The one-step formula for this case is 

1 / (x" - x')2 



p{t",x"\t',x') 



y/2TTa'^{t" - t') 



exp 



2a^{t" -t') 



(7.4) 



This formula is well known to be exact for arbitrary size time step\2(\]. The extension to the 
n— dimensional case is straightforward. 
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Example 2 (Linear Model) Consider the following state model: 

dx{t) = -0.2x{t)dt + Vwdv{t). (7.5) 

The pre-point one-step formula is seen to be the lowest order approximation to the exact 
solution]^]. Thus, this approximation is accurate for small time steps. 



(7.6) 



Example 3 (A Polynomial Nonlinear Model) Consider the model 

dx{t) = -4x{t){x^{t) - l)dt + O.ldv(t). 



The results for the transition probability density for x' = —0.75, —0.35, —0.05, 0.30, 0.70 are 
plotted in Figure^ along with values obtained from 10^ simulations. The time step size is 0.1. 
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Figure 1: Transition probability density for a nonlinear state model with polynomial drift (Equation 
T^. Here t" - t' = 0.1 and x' = -0.75, -0.35, -0.05, 0.30, 0.70. 



Example 4 (A Transcendental Nonlinear Model) Consider the model 

dx{t) = 1.2cos{3x{t))dt + 0.3dv(t). 



(7.7) 



The symmetric one-step approximate path integral formula for the conditional probability den- 
sity is plotted in Figure |^. The time step size is 0. 2. 
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These examples illustrate that some of the simplest one-step path integral approximation 
formulas are often very accurate. Clearly, the smaller the time steps, the more accurate is 
the approximation. What is remarkable is to note that the time step need not be infinitesimal 
for extremely accurate results. In practice, the time steps are often small, and the one-step 
approximations discussed here might be adequate for many applications. For larger time 
steps, there are better one-step approximations that have been obtained and will yield better 
performance. 

Finally, observe that time dependence is not a problem provided the variation of the 
diffusion vielbein is small over the time interval {t" — t'). Of course, this means that the 
transition probability density depends on the actual times {t" and t'), rather than just their 
difference (as in the time-independent case). 

8. Additional Remarks 

8.1 Physical Meaning of the Path Integral Formula 

A beautiful discussion of the path integral in the quantum mechanical context is given in 
[ |T^ ] . Let us consider the model discussed in Section ^ Intuitively, the path integral is a sum 
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of contributions over all paths satisfying the boundary conditions: x(to) = ^^o and x{t) = x. 
The action is computed for each path, and the exponential of the action is the contribution 
from that path. In other words, the contributions of the paths are exponentially weighted 
according to the action. It is clear that the dominant contribution comes from paths near the 
path of least action. 

This heuristic picture is seen to be consistent with intuition in the limit the noise is 
small. Let us consider the model discussed in Section ^ and assume that the noise is absent. 
Then, the evolution of the state is governed by a deterministic dynamical equation — the state 
evolves according to the zero noise dynamical law: 



±i{t) = fi{x{t)), i = l,2,...,n. (8.1) 
According to the path integral formula. Equation |4.12| , when the noise is small, action is small 



only if the ordinary differential equation (ODE) Equation 8.1 is satisfied (or nearly so). This 



implies that, to a good approximation, the particle moves along the deterministic trajectory 



specified by Equation |8.1| , as one would intuitively expect. The path integral formula gives a 
precise sense in which the deviation from the deterministic path is probabilistically possible 
as the variance of the noise, h^^, increases. This is analogous to the situation in quantum 
physics, where the role played by hi, is played by the Planck's constant h. 

8.2 Comments on the integral of S-T. Yau and S. S-T. Yau 

A solution in terms of ordinary integrals was presented in . The formal solution derived by 
S-T. Yau and S. S-T. Yau is of the form 

u{t,x)= / (2^i)"/^ (8.2) 



n „i n 



\xi - yi)fiiy + t{x,y))dt 



1 + ^ai(x,y)f 



i=l 



for arbitrary initial condition n(0, x) = (Jo{x) (for details see [^). The important thing to 
note is that this is an infinite series in t. In addition, they present an estimate of the time 
interval on which their solution converges. In contrast, the path integral solution is infinite- 
dimensional integral but with a much simpler structure^. 

Also, note that the result in Q differs from that obtained here in a few aspects. Firstly, 
they do not assume current conservation, and so solve a more general equation than the 
FPKfe. Secondly, the derivations here assume that the signal model noise is additive, whereas 
in a weaker condition is assumed, namely that the diffusion matrix is orthogonal. However, 
unlike M the FPKfe is solved for the explicit time-dependent case. 



It is also important to note that they have represented an infinite dimensional path integral as a finite- 
dimensional ordinary integral. This may be useful in other areas, such as quantum mechanics and field theory. 
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9. Conclusion 

In this paper, the path integral formula has been derived for the Fokker-Planck-Kolmogorov 
forward equation that arises from the explicitly time-dependent Langevin dynamical equation 
with additive noise with a rectangular diffusion vielbein. This can be applied to solve the 
continuous-discrete filtering problems very accurately. 

The path integral formulation has several advantages. It has been successfully used 
to solve challenging problems in quantum field theory not solvable using other approaches. 
Furthermore, from a conceptual point of view, the path integral formulation has proven to 
be extremely valuable. The insights from path integrals to problems in several other fields, 
especially quantum mechanics and field theory, has been tremendous. 

Hence it is a very promising framework for studying the nonlinear filtering problem. 
Numerous numerical methods follow from the path integral formulas derived here. This is 
being utilized in subsequent work. 

A. Gaussian Integrals 

It is remarkable that the foundation of the sophisticated methods of path integrals is based on 
Gaussian integrals. A Gaussian integral is defined as one where the integrand is an exponential 
function with an exponent quadratic in the variables. The simplest Gaussian integral 




(A.1) 



can by evaluated by noting that 




(A.2) 



Transforming to polar coordinates with the substitutions 



X = r cos 9, < r < oo, 
y = rsin6', < 6' < 27r, 



(A.3) 



so that the measure becomes dxdy = rdrdO, it follows that 




(A.4) 



= TT. 



Hence, 




(A.5) 
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or more generally (assuming a > so that the integral is convergent), 

/ e-"^ dx= e"^ z = V^x, (A.6) 



The general one-dimensional Gaussian integral can be evaluated by completing squares: 

^_^ax^+hx+c)^^^^-c e-"K"+^) -3^Jd:c, (A.7) 

-oo J —oo 



a 



Define the n— dimensional measure 

{d^x} = dxidx2 ■ ■ ■ dxn- (A.8) 

Then, it follows that 

/ /•oo 

{d"x}e"^^=i"^''*(ia; = JJ / dx^e""^^?, (A.9) 



> 0. 



Next, consider the quadratic form 



a ^2 XiAijXj, (A. 10) 

where a is a positive real number and ^ is a real nxn symmetric matrix^. A real, symmetric 
matrix can always be diagonalized by an orthogonal transformation O: 

A = O^A(A)0, (A.13) 



'^Note that any matrix can be written as a sum of a symmetric matrix(yl'^) and an antisymmetric matrix 
because of the identity 

A = - I - = A + j4 , A^j = Ajf, Aj^j = —Ajf. (A. 11) 

However, since Xi are commuting variables, 

n n 

XiAfjXj = XjAfiXi, (i ^ j) (A. 12) 

i,j=l i,j=^ 

n n 
— ^ Xjj^ijXi — ^ X'i-A.ijXj — 0. 

Hence, without loss of generality, A may be assumed to be a symmetric matrix. 
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Here, A(^) is a diagonal matrix with its diagonal entries being the eigenvalues \i{A) of A. 
Thus, 



a ^ XiAijXj = a ^ yikij{A)yj, 



(A.14) 



n n 

a \i{A)yf, yi = YOijXj. 

i,j=l j=l 



Since the Jacobian is unity (detO = 1), 

_ /7r\"/2 1 

/7r\n/2 1 

= - , a>0. 

Va/ VdetA 

The most general quadratic form can be written as follows: 

n n 

a(3 ^ XiAijXj+[5 ^ biXi + c = 



(A.15) 



(A.16) 



i,i=l \ A:=l / V fe=l / 



4q; 



Under the substitution 



Di — 



(A.17) 



k=l 



dyi = dxi and the limits of integration are unchanged. Therefore, the general n— dimensional 
Gaussian integral is 



Z{A, b,c) = J {(Tx} e-^[" ^^^=1 ^*^*^^^+Sr=i biXi+c/p] ^ 



(A.18) 



, n/2 



a/3/ Vdet^ 



The normalized measure 



[cTx] = 



IT 



n/2 



(A.19) 
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is defined so that 



(A.20) 



1. 



In the calculation of the expectation values of polynomials in random vectors with a 
Gaussian distribution, integrals of the form arise: 



-. (A.21) 



From the Equation A. 18, it follows from repeated differentiation with respect to Pb, that 



vr 

-D' d d 



Vdet A 



-1)' d d 



d 



(3^ dbk^ dbk2 dbk^ 
In fact, if F{xi,X2, • • • , Xn) is a power series in xf. 

Id Id 



/?' dbk^ dbk^ dbk, 



Z{A,b,0) 



6=0 



(A.22) 



6=0 



(F(xi,X2, 

An example is the two-point function: 



6=0 



(A.23) 



^ /32 2a ^ ' 



(A.24) 



1 



2al3 



The expectation can be simplified further. Note that if / is odd, the expectation vanishes. 
When / is even, it is easy to show (using the method of induction) that: 



Permutations P of {ki,k2,...,ki\ 



2a I "''Pi^P2 



kpi^lkp^^ 



(A.25) 



E 



Permutations of {ki,k2,...,ki} 



\Xp,Xk^,J---{^Xk^^^^Xk 



This can be formally generalized to an infinite number of variables, i.e., fields, and is then 
referred to as the Wick's theorem for bosonic fields in quantum field theory. 
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B. Delta- Function and Constraints 



The delta function^ has several representations. The most useful representation for our pur- 
poses is the following representation of the n— dimensional delta function 

5^{x - x') = J {d^k} ^T-=,kM-^^^ (B.l) 

= J [(Tk] e^E?=ifei(^-^^). 

The Fourier transform and its inverse are defined as follows: 

fik) = I {d"x}e-^^"=i'=^->/(x), (B.2) 

Consider the problem of writing down the expression of a function (t{x) at the point 
where f{x) = without actually solving the constraint equation. Assume that the solution 
to the equation f{x) = is unique. Then, 

o-{x)\f(x)=o = o-{xc), f{xc) = 0, (B.3) 
{(Tx} 5{x — Xc)(t{x). 



I 



Prom the sequence of identities 

1 = J {ry}6^\y), (B.4) 



= J {ctx} J{x)6-{f{x)), J = det (^^(x) 

{d''x}6''(x-xc), 



it follows that 



r(/(.)) = ^^. (B.5) 

S'(x-x,)=S'(f(x))J(x). 



^Strictly speaking, the delta function should be called the delta distribution defined rigorously as a limit 
of sequence of certain functions; it is not mathematically meaningful to view the Dirac delta function as a 
function. However, for most of our applications, the manipulations done here are valid if the delta function is 
always considered to be a part of the integrand. 
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Hence, 



a{x)= / Ka;}5"(/(x))J(x)^(x). 



(B.6) 



Using the Fourier integral representation of the delta function in Equation B.l, it is clear that 
a(x)|;(.)=o = J {rX} e^^/W J(x)a(x). (B.7) 

C. Functional Calculus: A Brief Note 

Recall that a function f{x) gives a number for each point x. The infinite dimensional gener- 
alization of a function is called a functional, f[x{t)], which gives a number for each function 
x{t). Loosely speaking, the functional derivative may be defined analogous to the ordinary 
derivative as follows: 



5x{t') e^o e 



(C.l) 



Alternatively, the functional derivative, -g§(trj{x{t)) ■, of the functional f{x{t)) with respect to 
variation of the function x{t) at s is defined by the following equation 



f[x{t)+rj{t))=f[x{t)) + 



Sx(s' 



— {x{s))rj{s')ds' H . 



(C.2) 



Note that the functional derivative of a functional is also a functional. Functional differenti- 
ation obeys the standard algebraic rules (linearity and Leibnitz's rule): 



6x{t') 



.i=l 



^ [h{x{t))f2{xm = f,{x{t))^^h{x{t)) + f,{x{t))^^h{x{t)). 



(C.3) 
(C.4) 



Just as the derivative of x with respect to x is 1, the functional derivative of x{t) with respect 
to x(t') is the unit matrix in infinite dimensions, namely the delta function: 



5x{t) 
6x{t') 



6{t-t'). 



(C.5) 



The functional derivatives of a formal power series in function x{t) can be seen to be analogous 
to the ordinary derivative of a power series in the variable t. Specifically, it is straightforward 
to verify that if 



oo ^ 

f{x{t)) = Y.— dti---dtn&\tiM,---,tn)x{ti)---x{tn) 
n=0 ^' 



(C.6) 
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then 

(C.7) 



\l=l 



x=0 



In particular, a functional representable as a power series in functions, like the exponential 
functional, can be functionally differentiated using this result. Thus 



■ exp dx' J{x')A{x')^ = 74(x) exp dx\ 



6J{x) 

The functional delta function may be written as 



J{x')A{x') . (C.8) 



^ifixit)) = / [dX{t)] expii dtXit)f{xit)) . (C.9) 



Note that Equation becomes 

5/ 



1= / [^/(x(t')))det 



6x{t') 



Sifixit))). (C.IO) 



In the application at hand, determinants of operators (matrices) arise in Gaussian inte- 
gration in the infinite (finite) dimesional case. Of particular interest is the operator det(<5(x — 
y) + K{x,y)), for some operator K(x,y), where it is assumed that traces of all powers of K 
exist. From the identity 

IndetM = trlnM, (C.ll) 

it follows that 

lndet[l + i^]= J dxK{x,x) - ^ j dxidx2K{xi,X2)K{x2,X3) ^ h (C.12) 

/ dxi - ■ ■ dXnK(xi,X2)K(x2,Xs) ■ ■ ■ K(Xn,Xi) -\ . 

n J 

Note that if K{x,y) = 6{x — y)K{x,y), only the first term is nonzero. 
References 

[1] A. H. Jazwinski, Stochastic Processes and Filtering Theory. Academic Press, 1970. 

[2] R. E. Kalman, "A new approach to linear filtering and prediction problems," Trans. ASME, J. 
Basic Eng., vol. 82D, pp. 35-45, March 1960. 

[3] R. E. Kalman and R. S. Buoy, "New results in linear filtering and prediction problems," Trans. 
ASME, J. Basic Eng., vol. 83, pp. 95-108, 1961. 

[4] T. Kailath, A. H. Sayed, and B. Hassibi, Linear Estimation. Prentice-Hall, Inc, 2000. 

[5] V. Thomee, Handbook oj Numerical Analysis. North-Holland, 1990, vol. 1, eh. Finite difference 
methods for linear parabolic equations, pp. 5-196. 



- 36 - 



G. I. Marchuk, Handbook of Numerical Analysis. North-Holland, 1990, vol. I, ch. Splitting and 
Alternating Direction Methods, pp. 197-462. 

C. Canute, M. Y. Hussaini, A. Quarteroni, and J. T. A. Zang, Spectral Methods: Fundamentals 
in Single Domains. Springer- Verlag, 2006. 

V. Thomee, Galerkin Finite Element Methods for Parabolic Problems, ser. Springer Series in 
Computational Mathematics. Springer- Verlag, 1997, vol. 25. 

S.-T. Yau and S. S.-T. Yau, "Explicit solution of a kolmogorov equation," Applied Mathematics 

and Optimization, vol. 34, no. 3, pp. 231-266, 1996. 

P. A. M. Dirac, "The lagrangian in quantum mechanics," Physikalische Zeitschrift der 
Sowjetunion, vol. 3, pp. 64-72, 1933. 

, The Principles of Quantum Mechanics, 4th ed. London, UK: Oxford University Press, 

February 1982. 

R. P. Feynman and A. R. Hibbs, Quantum Mechanics and Path Integrals. McGraw-Hill Book 
Company, 1965. 

R. P. Feynman, "Space-time approach to non-relativistic quantum mechanics," Reviews of 
Modern Physics, vol. 20, pp. 367-387, 1948. 

W. Sicgcl, "Fields," 1999. [Online]. Available: 

http: / /www.citebase.org/abstract?id=oai:arXiv.org:hep-th/9912205 

J. Glimm and A. Jaffe, Quantum Physics: A Functional Integral Point of View, 2nd ed. New 
York, USA: Springer- Verlag, May 1987. 

H. J. Rothe, Lattice Gauge Theories: An Introduction, 3rd ed. World Scientific, May 2005. 

H. Risken, The Fokker-Planck Equation: Methods of Solution and applications, 2nd ed., 
H. Haken, Ed. Springer- Verlag, 1999. 

J. Zinn-Justin, Quantum Field Theory and Critical Phenomena, ser. Int. Ser. Monogr. Phys. 
Oxford University Press, 2002. 

Z. Liang and S. S.-T. Yau, "Wavclct-galerkin method for the kolmogorov equation," 
Mathematical and Computer Modelling, vol. 40, no. 9-10, pp. 1093 1121, Nov. 2004. [Online]. 
Available: http: / / www.sciencedirect.com/science/article/B6V0V-4FWJGTY- 
H/2/099a028ded368f222d8dfc6353898501 

[20] A. T. Bharucha-Reid, Elements of the Theory of Markov Processes and Their Applications. 
McGraw-Hill, 1960. 



-37- 



